Modeling of sentence-medial pauses in bangla readout speech: occurrence and duration
نویسندگان
چکیده
Control of pause occurrence and duration is an important issue for text-to-speech synthesis systems. In text-readout speech, pauses occur unconditionally at sentence boundaries and with high probability at major syntactic boundaries such as clause boundaries, but more or less arbitrarily at minor syntactic boundaries. Pause duration tends to be longer at the end of a longer syntactic unit. A detailed analysis is conducted for sentence-medial pauses for readout speech of Bangla. Based on the results, linear models (with variables of syntactic unit length and distance to directly modifying word) are constructed for pause occurrence and duration. The models are evaluated using the test data not included in the analyzed data (open-test condition). The results show that the proposed models can predict occurrence probability for 87% of phrase boundaries correctly, and pause duration within ±100 ms for 80% of the cases.
منابع مشابه
Analysis of Influence of L2 English Speakers' Fluency on Occurrence and Duration of Sentence-medial Pauses in English Readout Speech
Pause plays important roles for the intelligibility, naturalness and fluency of speech. This paper reported the effect of native (L1) Bengali speakers’ fluency of English on occurrence probability and duration of sentencemedial pauses with respect to three factors: phrase type, phrase length (l), distance (d). In this analysis, 40 nonnative (L2) English (L1 Bengali) speakers’ data was divided i...
متن کاملFactors Affecting the Occurrence and Duration of Sentence-medial Pauses in Japanese Text Reading
Pauses play important roles for the intelligibility and naturalness of speech. Their occurrence and duration in text reading are influenced by syntactic structures of the text as well as by physiological constraints of respiration on the part of the speaker. In contrast to sentenceand paragraph-final pauses, sentence-medial pauses are influenced by a number of factors. Analysis of Japanese news...
متن کاملA Forced - alignment - based Study of Declarative Sentence - ending in
The phonetic characteristics of declarative sentence-ending ‘ta’ was examined based on the speech of one male Korean speaker in his 20s drawn from a large-scale speech corpus. An analysis using a Unicode-based phone alignment system was compared to an analysis based on manually corrected alignment and the two methods produced largely comparable results. The declarative-ending ‘da,’ which coinci...
متن کاملSpeech pauses and gestural holds in parkinson²s disease
Parkinsons disease (PD) belongs to a class of neurodegenerative diseases that affect the patients speech, motor, and cognitive capabilities. All three deÞcits affect the multimodal communication channels of speech and gesture. We present a study on the changes in speech pause patterns and gesture holds before and after treatment. We present the results of a pilot study of two Idiopathic PD pa...
متن کاملAllophone-based acoustic modeling for Persian phoneme recognition
Phoneme recognition is one of the fundamental phases of automatic speech recognition. Coarticulation which refers to the integration of sounds, is one of the important obstacles in phoneme recognition. In other words, each phone is influenced and changed by the characteristics of its neighbor phones, and coarticulation is responsible for most of these changes. The idea of modeling the effects o...
متن کامل